Checkpointing and Rollback Recovery in Object-based Systems
نویسندگان
چکیده
In distributed applications like teleconferences, a group of multiple autonomous objects are required to be cooperated to achieve some objectives. An object o is modeled as a pair of data structure and a collection of operations. On receipt of a request message with an operation op, op is computed in o and sends back a response message with the result of op. During the computation of op, op may invoke operations on other objects, i.e. the operations are nested.
منابع مشابه
An Application-Transparent, Platform-Independent Approach to Rollback-Recovery for Mobile Agent Systems
This paper proposes a new approach to rollback-recovery for mobile-agent systems, and describes its implementation in the MESSENGERS mobile agents system. The used checkpointing method allows to implement space and time efficient, user-transparent rollback-recovery in heterogeneous distributed environments. Together with an efficient non-blocking system snapshot algorithm this checkpointing met...
متن کاملSurvey of Backward Error Recovery Techniques for Multicomputers Based on Checkpointing and Rollback
For implementing fault-tolerance in multicomputer systems, backward error recovery, based on checkpointing and rollback, is often used. During failurefree operation, the process states are regularly saved, and after a fault is detected, the system is rolled back to a previously saved state. We can distinguish four classes of techniques: semi-automatic techniques, message logging, coordinated ch...
متن کاملCheckpointing and Rollback Recovery in Distributed Systems: Existing Solutions, Open Issues and Proposed Solutions
Checkpointing and rollback recovery are wellestablished techniques for dealing with failures in distributed systems. In this paper, we briefly summarize the existing solution approaches to these problems and also discuss the open issues, suggested approaches and some preliminary work that we have done addressing the open issues.
متن کاملCoordinated Checkpointing-Rollback Error Recovery for Distributed Shared Memory Multicomputers
Most recovery schemes that have been proposed for Distributed Shared Memory (DSM) systems require unnecessarily high checkpointing frequency and checkpoint traffic, which are sensitive to the frequency of interprocess communication in the applications. For message-passing systems, low overhead error recovery based on coordinated checkpointing allows the frequency of checkpointing to be determin...
متن کاملBlocking and Non-blocking Checkpointing and Rollback Recovery for Networks-on-Chip
In this paper we propose a dynamically reconfigurable failure recovery scheme developed for Network-on-Chip (NoC) based systems. The recovery scheme is based on a checkpointing and rollback protocol and permits enhancing the system fault tolerance capabilities by exploiting information on traffic load and failure rate. The increased performance of the fault tolerance mechanism is achieved by si...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996